|
|
Accession Number |
TCMCG075C18439 |
gbkey |
CDS |
Protein Id |
XP_007022856.1 |
Location |
join(224214..224635,224951..225293,225631..225855,226157..226385,227174..227352,228209..228274,229074..229151) |
Gene |
LOC18595034 |
GeneID |
18595034 |
Organism |
Theobroma cacao |
|
|
Length |
513aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA341501 |
db_source |
XM_007022794.2
|
Definition |
PREDICTED: U3 snoRNP-associated protein-like EMB2271 isoform X2 [Theobroma cacao] |
CDS: ATGAAGAGCAAGAACGAGAACAAGAGAGTGTCGGCCCCAAAAGGAGGGCAAAAAGGCGGGAAATTTTCCATGAATAATGGCCCATTCTTCGAAGCGGAGAGCAAGAAGCGGCGGAAAACGGAGTACGAGGACGATGACATTGAGTCAGGCGACTCAGAAGAGGAGGCAGAGATTCTCGGGGGTTTTGACGCAAGCGGGGGAGTGGAGGAGGATGAGGATGATGTGGAGACCCCCGGTGAGATTCGAAAAAGGGTGGCGACGGAGCTTTTGGATAAGATGCGGGCAATAGCAAGGAAGGAGAAAGAAGATGAGGATGAGGATTGGGAGGAAGGGGAAGAGGCCAGGGATTCGGTTGTGGCGAAGATTTTGCAGCAGGAGCAGCTTGAGGAGAGCGGCAGGGCAAGGAGAGTCCTTGCTTCCAGACTTAAAAAACCAGAAACTACTGATGGATTTAAGGTCTTAGTGAAGCACCAACAATCTGTTACTGCTGTAGCTCTCTCTGATGATGACTTGAAGGGCTTTTCAGCATCCAAAGACGGTACTATCTTACAATGGGATGTAGAAAGTGGCAAAAGTGCAAAGTACCAATGGCCTAGTGAAGATATTCTTAAGAGTCATGGGGCCAAGGATCCACGTGGTCGAGTTAAAAAACATAGTAGAAATGTCTTAGCATTGGCTGTTAGTTCTGATGGGCGATATTTGGCAAGTGGAGGCTTGGACCGCCATGTTCATCTGTGGGACATTCGTACAAGAGAGCATTTACAGGCATTTCCAGGTCATCAAAAACCTGTTTCATGTTTAAGTTTTAGGCAAGGCACGGCAGATCTTTTTTCTGGATCATTTGATCGAACAGTCAAGTATTGGAATATGGAAGACAGAGCTTACATTGACACAATATATGGTCATGAAAGTGAAGTATTGACGCTTGATTGCCTAAGGAAAGAACGAGTGTTGACTGTTGGACGTGATCGGTGTATGATGTTGTTTAAGGTCCTTGACCAGTCACGGTTGGTATTTCGTCCTCCACCATCATCTTTGGAATGCTGCTGCTTTGTTAACAATGATGAATTCTTATCTGGCTCGGACGATGGAAGTATTGAACTTTGGAGCATTGGAAGAAAGAAACCTGTATACATTGTGAAGAATGCTCATGCTCTGCTGCCTGCCTGTCAGAATGTTGAACAAAAAGGCAGTGAAAAAATCCCCAATATCCGTTTAGAGAACGGTGATCACAAAATTGAGAGCTATAGTAGTTCATCAACATATTCCTGGGTCAGTTCAATCACTGTATGTAGAGGCAGTGACCTTGCTGCATCAGGAGCTGGTAATGGCTGCATTCAATTATGGGCCATTGAGAGTGGGAGAAAGGACATCCAGCCCTTATATGGCATTCCCTTGGTGGGATTTGTTAATTCCCTGGCTTTTGCAAATTCTGGACAGTTTCTAATTGCTGGAGTTGGGCAGGAACCTAGACTAGGAAGATGGGGACGCCATCCAACTGCTCGGAATGGAGTTGCAATTCAATCATTGAAGCTCTTGTAA |
Protein: MKSKNENKRVSAPKGGQKGGKFSMNNGPFFEAESKKRRKTEYEDDDIESGDSEEEAEILGGFDASGGVEEDEDDVETPGEIRKRVATELLDKMRAIARKEKEDEDEDWEEGEEARDSVVAKILQQEQLEESGRARRVLASRLKKPETTDGFKVLVKHQQSVTAVALSDDDLKGFSASKDGTILQWDVESGKSAKYQWPSEDILKSHGAKDPRGRVKKHSRNVLALAVSSDGRYLASGGLDRHVHLWDIRTREHLQAFPGHQKPVSCLSFRQGTADLFSGSFDRTVKYWNMEDRAYIDTIYGHESEVLTLDCLRKERVLTVGRDRCMMLFKVLDQSRLVFRPPPSSLECCCFVNNDEFLSGSDDGSIELWSIGRKKPVYIVKNAHALLPACQNVEQKGSEKIPNIRLENGDHKIESYSSSSTYSWVSSITVCRGSDLAASGAGNGCIQLWAIESGRKDIQPLYGIPLVGFVNSLAFANSGQFLIAGVGQEPRLGRWGRHPTARNGVAIQSLKLL |